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ABSTRACT 

A  transform  method  is  presented  for  detecting  elements 
of  a  parametric  family  of  curves  in  noisy  point-set  images. 
Local  maxima  of  the  transform  in  the  parameter  space  will 
correspond  to  best  approximations  to  parts  of  the  data.  The 
cases  of  linear,  circular,  and  parabolic  approximations  are 
discussed. 
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SI.  Introduction 

Statistical  geometry  is  the  study  of  how  to  restore  pure 
geometric  objects  when  we  can  only  observe  deformed  versions 
of  them.  The  case  with  which  this  paper  is  concerned  is  when 
the  pure  geometric  objects  are  finite  subsets  of  curve  segments 
from  a  given  parametric  family.  If  we  call  the  pure  image  I, 
then  D  (I)  ,  the  deformed  version,  consists  of  a  (random)  point- 
subset  of  I  with  additive  noise  superimposed,  with  perhaps 
an  additional  point-process  as  background  noise. 

In  this  paper  we  shall  only  present  an  approximation 
method,  similar  to  the  Hough  transform  as  described  in  [2] 
and  to  the  chamfer  matching  in  [1] .  Additional  Hough  transform 
references  are  in  [4],  Distributional  considerations  will  be 
examined  later.  The  general  problem,  which  covers  a  wide  range 
of  distributional  assumptions,  appears  below. 

We  suppose  ilcr  is  some  parameter  space  with  F  =  {{(x,y): 

2  2 
f(£  ,x,y)  =  0,  (x,y)t  R  }:  £  a  family  of  curves  in  R  para¬ 

meterized  by  £  .  Let  (x^,y^) , . . . , (x^,y^)  be  a  set  of  N  points 

2 

of  R  that  arise  as  a  deformed  version  of  a  set  of  segments  of 
M  curves  parameterized  by  C1,...,CM  of  F.  We  shall  assume  that 
>  Y  )»•••»  (x^ ,  y^)  arises  as  a  deformation  of  a  point-process 
realization  of  the  segments,  plus  additive  noise. 

Let  £  denote  a  curve  of  family  F  (we  identify  a  curve  with 

its  parameter) .  A  segment  I  of  5  of  length  L  is  a  regular  curve 

segment  [3]  of  £  parameterized  by  arc-length.  That  is,  there 

o 

is  a  function  a:  [0,L]-*-R  satisfying 


(2)  a  is  one-to-one,  except  perhaps  at  the  end  points. 

(3)  The  following  identity  holds: 

a  =  ids  0  *  a  i  1. 

Let  P  =  {s^<S2<. . . <sN>  be  a  point-process  realization  of  the 
interval  [0,L]  consisting  of  N  points,  with  an  interpoint  dis¬ 
tance  A  that  is  a  random  variable  with  a  known  distribution. 

Let  e^,...,e  be  independent  noise  random  variables  with  values 

2  2 
in  R  ,  each  component  having  mean  0  and  variance  a  .  Then 

D(I)={ (xi,yi)=(a1(si) ,a2(si))+ei  :  i=l,...,N} 

is  the  randomly  deformed  version  of  the  image  I .  The  above  con¬ 
struction  is  extended  to  a  set  of  M  segments  from  F.  In  the 
rest  of  this  paper  we  assume  no  background  noise. 

In  Section  2  we  discuss  the  case  of  fitting  a  single  curve 
to  a  data  set  based  on  simple  least-squares  minimization.  Although 
this  direct  method  of  fitting  a  single  curve  can  be  extended  to 
fitting  several  curves  simultaneously,  difficulties  are  presented 
in  that  the  number  of  curves  present  in  an  image  is  usually  un¬ 
known,  and  that  the  optimization  solution  is  complicated.  In 
Section  3  we  define  a  more  robust  procedure  than  least-squares 
fitting  in  which  the  parameter  space  is  searched  for  local  maxima 
after  a  transform  method  similar  to  the  Hough  transform.  Finally 
the  computations  needed  to  carry  out  the  procedure  for  linear, 
parabolic,  and  circular  segments  are  given.  We  shall  not  consider 
in  this  paper  methods  for  searching  for  local  extrema. 


§2.  Fitting  a  single  curve  segment 

For  fitting  a  single  curve  segment  of  5  to  a  set  of  data 
points  { (x1,y1) , . . . ,  }  we  can  consider  the  least-squares 


minimization  problem 


£  [(xi-a.)2+(yi-6i)2] 


(  i  ,  (o^,^)  («n»3n))  i=l 

with  5 

f (  5  =  0  i=l,...,N. 

This  problem  is  solved  by  fixing  £  and  finding  the  solution 
(  5  )  »  Bi*  (  5  ) )  that  minimizes  (xi*ai)  2+  (y  -3.)  2  with 
f(£  , a.,3.)  =0  i=l,...,N.  By  the  Lagrange  Multiplier  Method 


we  get  the  "normal  equations"  for  a  >  (  5  ): 


ai_Xi+Xfa(  ^  '°‘i'6i)  =  0 


Bi-yi+Xfe( 5  ,ai# ei)  =  0 

f  U  ,a±r&i)  =  0 


I  Iff.  ^?0^> 

KT  ;  ■  te : 

r  T  3 


t  .  •  •  9 


•  f 


The  above  minimization  problem  is  now  reduced  to  .  ;.v3  i  . 

T  'Avm  1  :  or. 

II  '  t  ■  r  t  '  <’  ’■ > 

min  l  (xi-ai*( C  ))2+(y.-6i*(^  ))2  j  A*  j 

sfcfl  i=i  | 


a*  search  for  the  minimum  in  parameter  space. 

In  some  cases  solution  of  the  normal  equations  is  difficult, 
and  the  linear  approximation  to  the  constraints  of  the  minimiza¬ 
tion  problem  is  valid.  We  consider  the  linear  approximation 
f  (C,xi+6i,yi+ei)  = 


f  (C/Xi,yi)+7a^f  (C/X,y)  (£i) 


9  •  •  •  9 


If  ou*  =x^+6^,  |^*  =  yi+*:i  are  within  the  range  of  the  above 
approximation,  then  the  above  problem  may  bo  approximated  by 

N  2  2 

min  E  (6.  +e.  ) 

(C,(61,e1),...,(6N,eN))  i=l  1  1 


with 


Vcx,6  f  UfXi#yi>  (gi)  =  -  f(C,xi,yi)  i=l,...,N. 

For  a  fixed  and  i=l,...,N  we  define 

v.*(£)  =  min  (6.2+e.2) 

1  <Vei>  1  1 

with  fa(^,xi,yi) 6i+f g (C/Xi,yi) ei  =  -  f(C,xi,yi). 

It  can  be  shown  that 

2 


v±*  {£)  - 


f (C,xi,yi) 


Va,6£(5'Xi'yi>11 


2  i£  >*  0 


0  otherwise 

The  linearized  least-squares  problem  then  becomes 
N 

min  E  v . * (£) . 
itn  i=i  1 


The  above  methods  can  be  used  in  the  case  of  approximating 

a  set  of  M  curves  5 to  the  data.  We  could  make  the  same 

1  M  M 

calculations  as  above  but  with  g  ,€M,xry)  =  IT  f(£.,x,y)  in 

i  m  j=1  i 

place  of  f(£,x,y).  we  would  then  have  a  minimization  problem  in 

M 

Q  ,  which  is  a  rather  unpractical  situation.  In  the  next  section 


we  discuss  a  transform  method  which  results  in  searching  for  local 
extrema,  but  only  in  Q. 


§3,  Fitting  several  curve  segments  simultaneously 

In  this  section  we  consider  a  transform  from  the  data 

set  to  the  parameter  space  ft  in  which  search  for  local  extrema 

allows  one  to  extract  the  best  fitting  curve  segments  from  F. 

As  examples  of  the  computation  involved,  we  consider  three  cases 

linear,  parabolic,  and  circular  segments.  Note  that  multiple 

segments  from  the  same  curve  are  allowed.  Attention  is  paid 

2 

to  the  variance  o  of  the  noise  as  well  as  the  distribution  of 
A,  the  interpoint  distance  in  a  given  segment. 

Let  g(r)  be  a  decreasing  function  on  r*0  satisfying 

(1)  g(r)=0  r  ^c  for  some  c  >  0 

9  9 

(2)  smoothness  assumptions  on  g  as  may  be  required. 

A  function  g  satisfying  these  requirements  will  be  called  an 

influence  function.  [In  practice  we  choose  c  based  on  the 

y 

2 

variance  a  of  the  noise;  for  example 
P ( | X | <  c  )  =  .75 
2 

where  X  is  a  N(0,cr  )  r.v.  The  influence  function  could  take 

2  2 

the  form  g(r)  =  s(cg  -  r  )  for  0*r*Cg,  and  some  s>0] . 

We  consider  the  following  problem: 


max 

/  #  •  •  •  t  ( >  3^ ) ) 

with 

f(C,a.,e.)  = 


N 

I 

i=l 


g(/(xi"ai,2+(yi'3i)2) 


i=l 


0 


;  •  •  •  / 


N 


We  fix  and  solve  for  each  i=l,...f 


N 


i  x 


with  f(C»ai»@i)  =  0 


The  response  function  R(£)  is  defined  by 


N 


R(C)  =  £  g(v.*U)) 

i=l  1 


and  the  response  set  S{£)  by 

S(C)  =  {(ai*,Bi*)  i=l , . . . ,N:  vi*(C)<c  } 

We  note  that  g(v^*(£))  is  positive  only  if  the  distance  from 
the  data  point  (x^,y^)  to  the  curve  £  is  less  than  the  threshold 
value  c  as  defined  in  the  influence  function.  We  can  then 

g 

expect  R(£)  to  be  maximal  (locally)  at  the  £ ' s  that  fit  the 
largest  proportion  of  the  data  set  ^x^,y^)  , . . . ,  (x^,y^)  •  The 
response  sets  S (£)  at  the  maximal  £ ' s  give  the  points  of  the 
curve  £  that  best  approximate  the  image  before  the  additive 
noise.  Recalling  that  we  assumed  the  (a  priori)  knowledge  of 
the  interpoint  distance  A  we  can  use  this  as  a  basis  for  clustering 
the  response  set  S(£)  into  segments. 

In  the  next  section  we  examine  three  cases  of  paramerized 
families  of  curves:  linear,  parabolic,  and  circular. 


J 


Examples 

A.  Line  segments 

As  in  [2]  the  parameter  space  is  (9,Y) / 

(-00,00)  f  where  a  line  is  defined  by  the  equation 
cosf'fOx  +  sin(<P)y  -  y  =  0. 

For  each  data  point  (x.,y.)  we  thus  consider  the  problem 
minKa-x.^)  2  +  ( 6-yi)  2 ] 

with  cos(<p)a  +  sin(9)3  -  y  =  0. 

•» 

By  means  of  the  Lagrange  Multiplier  Method  we  can  show  that 
a^*=  sin2(«p)x^  -  sin  ('P)  cos  (9)  y^  +  ycos(9) 

,  0^*=  cos2(9)y^  -  sin  (CP)  cos  ( CP)  +  ycos(T) 

^  v^*=  Ix^cos^)  +  y^sin(£P)  -  y| 

In  pr  'Otice,  the  parameter  is  discretized.  For  each  data 
point  (x^,y^)  and  each  cp  from  the  discretized  set  we  compute 
gClx^ost^)  +  yisin(cP)  -  y|)  for  all  Y  such  that  Ix^osfcp)  + 
y^sin(cp)  -  y|  <  c^.  (thus  we  don’t  have  to  compute  a  value  for 
each  (<P,y)).  Summing  all  these  contributions  from  all  the 
data  points  gives  the  response  function  R(£) .  After  searching 
the  parameter  space  for  the  locally  maximal  E, ,  we  compute 
the  response  set  S{£).  Let  (a,  3)  S(0*  The  transformation 

/a\  /cos  (-S')  -sin(-cf')\  /a\  /  Y  \ 

^0/  ysin(-9)  cos  (-S')  J  \$ )  J 

-( 

V  cos  (S')  6  -  sin  (9)  a 


is  used  to  transform  the  response  set  S(£)  to  the  y-axis. 


A  clustering  procedure  on  R  may  then  be  used  to  find  the 
segments  of  4 • 

B.  Circular  segments 


We  consider  the  parameter  space  with  elements  (9,y,p) 

cP£[-TT/iO,  Y>0,  p>  c  specifying  the  curve 

y 

2  2 

(cos(cp)x  +  sin(9)y  -  y)  +  (cos(9)y  -  sin(9)x)  -  p  =  0 
(the  equation  of  the  curve  with  center  at  polar  coordinates 
(9,Y)  and  radius  p)  .  For  each  data  point  (x^,y^)  we  then  con 
sider  the  problem 

min[(a-xi)2  +  (B-y^2] 

with  (cos(9)a  +  sin(9)B  -  Y)  ^  +  (005(9)3  -  sin(9)a)2- 
It  is  easy  to  see  (geometrically)  that 

(x1-ycos (9) ) p 

ai*  =  ""  1  - - —  +  Ycos(9) 

(xi-Ycos(9) ) 2+(yi-Ysin(9) ) 2 

(y . -Ysin(9) ) p 

Bi*  =  — -  +  Y  sin (9) 

^(x^-ycos (9) ) 2+(yi~Ysin(9) ) 2 

Vi  ~~  ^  (xi~Ysin(9) ) 2+ (y^-Ysin(9) ) 2  ” 

Points  (x^,y^)  with  v^*  <  cg  give  a  positive  contribution 
to  the  response  function  R(9»Y»P)  and  thus 

N 

R(9,Y»P)  =  £  V.*(9,Y/P) 

i=l  1 

can  be  computed  over  the  parameter  space  by  fixing  (9,y)  and 

computing  g (v^* (9, y * P) )  for  which  p  satisfies  v^*(9,Y,P)  < 

c  ,  p>c  .  As  in  the  case  of  linear  segments,  after  the  local 

y  y 

maxima  of  R(9,y,p)  have  been  located,  the  response  sets 


S  ( 9,  y ,  p )  are  computed  for  those  particular  values,  and  a 
similar  clustering  {involving  a  decision  procedure  based  on 
the  distribution  of  A,  the  interpoint  distance)  can  be  imple¬ 
mented  to  extract  segments. 

C.  Parabolic  segments 

The  parameter  space  has  elements  (9,7,0)  9€[-Tr,rr),  Y>0, 

a> 0  where  a  parabola  is  defined  by 

2 

[cos(9)x  +  sin(9)y]  +  sin(9)x  -  cos(9)y  +  Y  =  0 , 

which  is  simply  the  fundamental  parabola 
2 

x  -y  =  0 

scaled  by  o,  increased  by  y  and  rotated  by  9.  Thus  for  a 
data  point  (x^,y^)  we  first  transform 

I 

x^  +-  cosCTjx^  +  sin{<p)y^ 
y^  -sin(T)xi  +  cos(9)yi  -  y 

and  consider  the  problem 

min(a-x|) ^  +  (3-y.p2 
2 

with  3=aa  . 

We  form  the  Lagrangian 

L  =  (a-x|)2  +  (6-y^)  2  -  MB-oa2). 

The  minimization  results  in  the  simultaneous  equations 
(a-x!)  +  Aa a  =  0 

i 

(B-y[)  -  A  =  0 

D  2 

3  =  aa 

Solving  these  we  get 

[l-2ay ! ]  x! 

=  +  - —  a - -  = 


h(a) 


2a 


2a 


0 


m 


Then  [ou*]  will  be  a  root  of  this  equation,  where  ' 
denotes  the  above  transformation.  Correspondingly  [3^*]'  = 

o([ai*]'.’?,  vi*  =  [a^*]  « )  2+(y^-  [gi*]  • )  2  ^“i*1  '  Wl11  be  the 

real  root  of  h(a)  =  0  that  minimizes  v^*)  . 

No  more  will  be  said  about  searching  for  the  local 

N 

maxima  of  R(V,y,cs)  =  Z  g  (v .  *  (9,  Y , a)  )  ,  but  once  the  response 

i=l  1 

set  {(a^*,3^*):  i=l,...,N}  corresponding  to  a  particular  para¬ 

meter  has  been  found,  it  will  be  necessary  to  perform  a  clus¬ 
tering  procedure  to  extract  segments.  We  first  transform  the 

response  set  to  {([ou*]',  [3^*]'}  using  the  transformation  given 

2 

above,  so  the  points  lie  on  3 =oa  .  By  transforming  the 
2 

parabola  0=oa  to  the  line  y=0  (preserving  arc  length)  we  can 

use  the  same  clustering  procedure  in  the  linear  case.  Such  a 

2 

transformation  [3]  is  given  by  (a,oa  )  (s,0)  where  s  = 

sgn  (a)  (l/4o)[2a  |  a  | /^2^2  +  In  (2a  |  a  | +/^^2  )]. 


p 


5.  Conclusion 


We  have  presented  in  this  paper  a  procedure  for  fitting 
curve  segments  to  a  data  set  when  the  curves  originate  from  a 
given  parametric  family.  The  method  involves  a  transform  into 
the  parameter  space  where  the  largest  local  maxima  indicate  the 
best  fitting  curves  to  the  data.  The  data  points  within  a 
threshold  neighborhood  of  a  given  curve  from  the  previous  step 
are  projected  onto  the  curve.  The  resulting  point  set  can  then 
be  clustered  to  extract  curve  segments.  The  parameter  spaces 
and  basic  computations  for  carrying  out  this  procedure  have 
been  done  for  linear,  circular,  and  parabolic  segments. 

A  survey  of  the  many  variants  of  the  original  Hough  Transform 
can  be  found  in  [4].  Basically,  in  these  techniques  the  additive 
noise  (due  to  scatter)  is  taken  care  of  in  the  quantization  of 
the  parameter  space.  As  noted  in  [5]  this  has  led  to  unsatisfac¬ 
tory  results  when  random  noise  is  not  removed  by  the  quantization 
procedure,  and  a  maximum  entropy  quantization  is  introduced  (in 
the  case  the  noise  distribution  is  known) .  In  these  methods, 
after  transforming  and  thresholding,  a  clustering  procedure 
is  performed  to  extract  the  unknown  curves.  The  method  developed 
in  this  paper  is  not  as  sensitive  to  how  the  parameter  space  is 
quantized,  and  replaces  clustering  with  the  problem  of  detecting 
local  maxima.  Also  included  in  [4]  is  a  discussion  of  detection 
of  circular  and  parabolic  segments,  and  the  relationship  of  the 
Hough  Transform  to  template  matching  in  the  case  where  all  the 
curves  to  be  detected  are  translations  of  each  other. 


A  generalized  Hough  translorm  is  presented  in  [6],  and  its 
equivalence  to  a  formulation  of  the  point  pattern  matching 
problem  is  given.  In  this  formulation  a  set  of  object  points 
0£Rn,  a  set  of  feature  points  PcRm,  and  a  set  of  parameterized 
functions  f^  :  Rn  -*■  Rm  are  given,  and  the  problem  is  to  find 
f^  that  maximizes  the  number  of  points  in  0  that  are  mapped 
onto  points  in  P  by  f ^ .  The  GHT  procedure  is  to  consider  all 
point  pairs  (o,p)  ofeO,  pfcP  and  compute  all  £  such  that  f^(o)  = 
p  (up  to  quantization  level) .  In  practice  an  array  of  accumu¬ 
lators  HT  (one  for  each  of  the  quantized  parameter  values)  is 
constructed,  and  HT(£)  is  incremented  by  unity,  where  £  is  an 
outcome  of  the  above  computation.  In  the  modified  GHT  we  would 
increment  an  accumulator  E,  by 
g  (d  (o,  {o:  f  ^  (o)  =  p})  ) 

where  d  is  the  infimum  of  the  Euclidean  distances  from  o  to  all 
points  o  satisfying  f^(o)  =  p,  and  g  is  an  influence  function  as 


defined  above. 
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